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ABSTRACT 

The research questions whether many tests used by 
employers for employee selection do discriminate inadvertantly. 
Because sub-groups or minority groups tend to be culturally 
disadvantaged, standardized tests have many shortcomings. When used 
as selection devices standardized tests may: (1) not provide reliable 
differentiation in the range of the minority group's scores; (2) use 
predictive validity different from that for the standardization and 
validation groups; and (3) use interpretation that is quite dependent 
upon the cultural background of the majority. To eliminate cultural 
bias in testing, tests that are culture-free, culture-fair, 
culture-equivalent, and measure creativity need to be developed. The 
concepts of using "dual test standards" and "compensatory training" 
may help create new methods for employee selection. It is concluded 
that proper testing practices need to be followed by improvement in 
hiring situations. Recommendations are that: (1) job descriptions be 
examined and requirements established before tests are used; (2) 
tests be developed by reputable psychologists; (3) inflexible minimum 
scores be re-examined; (4) test scores be combined with other data on 
performance; (5) test scores be validated in the setting used; and 
(6) re-tests be offered to unsuccessful applicants with culturally 
disadvantaged backgrounds. (RL) 
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TESTING OF MINORITY GROUP APPLICANTS FOR EMPLOYMENT 



I. Introduction 

Tne Motorola case'^ and the Towar amendment to Title VII of t:.a 
Civil Rights Act of I964 Section 703 (h have dramatized the issue 
of "whether the use of general intelli.gence tests by employers as 
selection devices for hiring and promotion deprives Negroes and 
members of other minority groups of equal -employment opportunity. 
Individuals from culturally disadvantaged***backgrounds perform less 
well on these tests on the average than do applicants from middle class 
environments stnd consequently may be screened out of training programs 
and/or excluded from jobs. Differences in culture, in opportunity, 
and in experience can have a devastating effect on test perfomance. 
Since many Negroes, Mexican-Americans, Indians, and lower-class whites hav< 
not shared the middle class culture, they may perform in sm inferior 
manner on tests of general inte?JLigence, particularly paper and pencil, 
but not necessarily on performance for -which the tests are supposed to 
be predictive. 

Consistent and significant differences on mean scores are also found 
between age, sex, educational, and urban-rural groups, but the focus of 
this report is the effect of testing on the culturally disadvantaged, 

■^^•See Appendix A for the chronology of the Motorola case. 

"... nor shall it be an unla-wful employment 

practice for an employer to give and to act 

upon the results of any professionally developed 

ability test provided that such test, its 

administration or action upon the results is not 

designed, intended or used to discriminate because 

or race, color, religion, sex or national origin." 

Sec. 703(h) 
***See Appendix B. ^ 



many of whom are Negroes. This report is not concerned "with the wLlirul 
misuse of tests to discriminate such as giving tests to Negroes but not 
to \vhites , or requiring Negroes to achieve higher scores than ithites, or 
failing Negroes regardless of their actual performance. These practices 
are clearly unla-wful. The question to be considered here is -whether many 
"professionally developed ability tests'^ used by employers to select 
qualified applicants do in fa-t discriminate inadvertently . 

Authorities in the field of psychological testing have suggested 
several proposals for mitigating the effects of unintentional types of 
discrimination against minority groups. We have examined the various 
proposals and have concluded that carefiil selection and administering 
of tests and. validation of the testing instrument Kithin an industrial, 
setting, may be the most desirable means to achieve the goal of full 
utilization of the nation ^s huiuan resources. The implications of this 
affirmative conclusion are discussed from the viev/point of the Equal 
Bnployment Opportunity Commission^ private employers^ and the research 
psycho?-ogists who would have to assume the major responsibility for 
formiilating suitable standards for selection of testing programs. 
II. Types of Tests 

The major types of tests most commonly used in employee selection 
are: (l) general intelligence tests^ (2) tests of specific intellectual 
abilities, (3) knowledge, and skill tests , (4) measures of dexterity and 
coordination, and (5) inventories of personality traits. 

Intelligence tests such as the wonderlic, Stanf ord-Binst,- and Otis 

Quick-Scoring are designed primarily to measure the ability of the individ- 
ual to understand and to reason with words and numbers. Such tests are most 

6 
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useful in selection for jobs where learning from and understanding 
verbal academic material is important. 

Specific intellectual abilities tests determine potential for 
learning certain Kinds of work and for solving certain kinds of problems. 
The tests are not designed to test for a specific job, but to measure 
the skills for understanding and reasoning with words, numbers and symbols, 
visualizing of spatial relationships, word fluency, visufil 
speed ,.nd accuracy, and creative abilities. 

Knowledge and skill tests are usually specific. to a job or job family. 
Knowledge tests are designed to measure the understanding of blueprint 
reading, electronics, accounting, etc., while skill tests measure one's 
ability to type, to take dictation, to drive, etc- These tests measure 
the degree or level of knowledge or skill already attained by candidates at 
the time of the test. 

Dexterity and coordination tests measure speed and accuracy of physical 
movements. These tests must be very specific to the movements required in 
the job and are usually constructed by the employer. Examples of such 
tests are spatial and mechanical abilities, perceptual accuracy, motor 
abilities. 

Personality and interest tests are intended to indicate how a person 
typically acts and feels, arid to determine the type of activities he likes. 
Tests of this nature have been developed primarily for use in either 
vocational guidance or clinical use. It is extremely important for a 
highly trained professional psychologist to evaluate and Interpret the 
results of these tests. 
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Tests may be further categorized as aptitude versus proficiency. 
Aptitude tests are designed to measure potential while achievement 
tests measure skill level at the time of testing. 
III. How Tests Discriminate Against Minority Groups 

An aptitude test that fails to predict job performance in the same 
way for both Negroes and whites, or fails to predict job performance at all 
is not a valid test. If such a test is -weighted to differentiate between 
Negroes and whites, it is similarly invalid and simlarly discriminatory. 
Tests may be held to discriminate in the social sense if they deny equal 
opportunity for consideration. A test may operate in this manner (a) when 
scores on it tend to differentiate between identifiable sub-groups, where 
the sub-grouping itself is not a relevant selection factor, and either 
(b) scores for the lower group underpredict performance on the job when 
the standards of the upper-group are applied, or (c) scores on the test do 
not predict job performance for either group. ^ 

It is known that Negroes on the average do less well on 
paper and pencil tests than whites. Tae m^an scores for Negroes are lower 
than the mean scores for whites on most paper and pencil tests of general 
ability, intelligence, aptitude, learning ability, or overall ability. The 
distribution of scores overlap, often considerably, but the average scores 
differ significantly in most studies. 

^-Raw scores are converted to norms in order to compare an individual 
perfomiance with a "sjp^cific group. See glossary in Appendix B. 
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More research has been done on the testing of minority group 
children than workers, but the information -which has resiilted from this 
research offers insight into \jhy Negro adiilts achieve a lower mean 
score than job applicants from more middle class background. Newton S. 
Metfessel, Psychologist at the University of Southern California, in his 
research on children and youth who live in the culture of poverty* found that 
ciiltural factors such as home and family structure, personality and 
social characteristics, learning characteristics, and general school 
relationships handicap performance on tests. 

These children usually come from a home environment vihere there is 
such a paucity of objects that the child's conceptual formation develop- 
ment is adversely affected. They elso lack curiosity, and this affects 
both motivational patterns and the development of creative behavior. The 
culturally disadvantaged child is characterized by weak ego-development, 
a ,lack of self-confidence, and a negative self-concept. These conflicting 
feelings about himself frequently result in exaggerated positive and 
negative attitudes towards others.-^ ^ 

Many aspecLb of learning characteristics are affected by the 
ciilturally poor background. The culturally disadvantaged typically have 
a cognitive style which responds more to visual and kinesthetic signals 
than to oral or written stimuli. Also, these children learn more readily 
by inductive than deductive approaches. Learning experiences which move 
from the part to the whole rather than from the whole to the part a3?e 
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invariably more successfiil. Significant gaps in knowledge and uneven 

patterns of learning are typical of this t3^e of background. 

Children from the culture of poverty have had little experience in 
receiving approval for success in learning a task, an assumption on -which 
the school culture is organized* "The cycle of skill mastery which demands 
that successfiil experiences generate more motivation to perform which in 
turn guarantees levels of skill sufficient to prevent discouragement,, and 
so on, may be easily reversed in direction and end the achievement habit 
prior to its beginning."- 

In general school relationships and school characteristics, these 
children from the background of cultural deprivation are placed at a marked 
disadvantage on standardized tests, which for the most^part have been de- 
signed to test the white, middle class child. The shortcomings of the 
standardized tests when they are used with disadvantaged minority groups 
are discussed below. 

A. Reliability of Diff erentation 

Standardized tests may not provide reliable diff erentation in 
the range of the minority group's scores. The reliability coefficient 
for a particular test is strongly affected by the spread of test scores 
in the group for which the reliability is established. In general, the 
greater the spread of scores in the reliability samples, the higher the 
reliability coefficient. For many tests, there is evidence "that children 
from the lower socio-economic levels tend to have a smaller spread of 
scores than -do children from middle-income families, and such restriction 
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in the distribution of scores tends to lower reliability so far as 

u 

differentiation of measurement with such groups is concerned." 
B. Predictive Validity 

Second, the predictive validity of tests for minority groups 
may be quite different from that for the standardization and validation 
groups. Factors which may impair a testes predictive validity are: 

1. Test-related factors test taking skills, anxiety, 
motivation, speed, understanding of test instructions, degree of item 

or format novelty, examiner-examinee rapport which may affect test "scores 
but have little relation to the criterion. 

2. Complexity of criteria - It is important to recognize the 
influence of other factors, not measured by tests, which may contribute 
to criterion success. Since disadvantaged groups tend to do poorly on 
general intelligence and achievement tests of the paper and pencil type, 
one should explore background, personality, and motivation of members 

of such groups for compensatory factors, untapped by the test, which may 
be related to criterion performance.^ 

While certain aptitude and proficiency tests may have excellent 
criterion validity for some purposes, even the best of them aro unlikely 
to reflect the true capacity of underprivileged children. They tap 
abilities that have been molded by the cultural setting. The test content, 
mode of communication involved in responding to test items, and the 

motivation needed for making re'sponses are intrinsically dependent upon the 

^ ^ 6/ 
cioltural context.—' 
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C. Validity of Test Interpretation 

Third, the validity of the interpretation of tests is 
strongly dependent upon an adequate understanding of the social and 
ciiltural background of the group in question. Sources of error in test 
interpretation stemming from lack of recognition of the special features 
of culturally disadvantaged groups are: (l) de\d.ation error - tendency 
to infer maladjustment from responses which are deviant from the view- 
point of a majority culture, but which may be typical of a minority 
group. (2) simple determinant error - thinking of the test content as 
reflecting some absolute or pure trait, process, factor, or construct, 
irrespective of conditions of measurement or the population being 

studied. (3) failure barriers - requiring minority group individuals 

7/ 

to solve problems with unfamiliar tools. 

Job applicants from lower socio-economic levels may be 
characterized in contrast to their middle class counterparts as being 
less verbal, more fearful of strangers, less confident, less motivated 
toward scholastic and academic achievement, less conforming to middle 
class norais of behavior and conduct, less knowledgeable about the world 
outside their immediate neighborhood. To the extent that these sub- 
cultural differences affect test performance adversely, these persons 
may be denied the opportunity to employment and a more productive 
contribution to society. Selection instnmients often call for responses 
that are influenced by the culture of the applicant's community or 
quality of his educational opportunity. Since such tests are "culturally 
loaded" against persons from a lower socio-economic status, they may 
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operate as instruments of racial discrimination. The crucial question 
is whether employers use techniques that un'wittingly eliminate persons 
\dio might perform satisfactorily on the job. The relationship between 
test performance and cultural deprivation on the one hand? and job 
performance on the other, must be investigated for both white and 
nonwhite job applfcants. 

IV. Proposed Solutions to the Problem of Cultural Bias in Testing 

Most employers defend tests as an efficient device for choosing 
the most qualified applicants. Where Negro job applicants consistently 
score significantly below white job applicants a question should be raised 
about test scores as predictors of job performance. In an employment 
situation we would like to know whether differences between group means 
are also associated \rlth performance on the criterion. Do the factors 
that depress test performance also depress trainability or whatever 
criterion is to be predicted? Psychologists have suggested ways in which 
the effect of cultural bias inherent in many aptitude tests can be 
alleviated for minority group applicants. Few of these proposals have 
been universally acceptedt but most have been discussed in the professional 
literature on testing of minority groups and the culturally disadvantaged. 
A. Variants of "Culture-Free'^ and "Culture-Fair" Tests 

1* Culture-Free Tests . . 

One such proposal is the development of tests which are 
free of cultural bias in their content and instructions. Dr. Robert Krug, 
who has written extensively on testing of minority persons, indicates that 
one of two conditions must be met before a test can be classified as 
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"cvilture-free:" either the test items are those vM.ch all people 
of all cultiires have had equal opportunity and equal motive to learn, 
or the test items must possess complete novelty for all people of all 
cultures.-^ For all practical purposes these two conditions are almost 
impossible to meet and the idea is often rejected as unfeasible. 
Howard...Lockwood of Lockheed Corporation states that many industrial 
psychologists agree that even if such a test could be developed, it 
would be useless in personnel selection. It is impossible, he maintains, 
to avoid measuring cultural influences, and if they were completely 

2/ 

eliminated from all tests, the tests would measure, in essence, nothing. 

2. Culture-Fair Tests 

Dr. Krug, on the other hand, does not reject the idea en- 
tirely. He describes a " culture-fair " test, as a modification of the 
" c\ilture-free " idea. The assumption underlying the " culture-fv^ir " tests 
is that there exists a set of test stimuli which are equally appropriate, 
that is, equal opportunity and motive to learn, for at least two 
cultural groups.—^ Dr. Paul Schwartz, who headed an AID--sponsOred 
aptitude test development project in West Africa, has done 
most of the research in this area. A "culture-fair" test or "culture- 
common test" developed by Schwartz for Nigerian and American children 
utilized a set of fruits and vegetables which were approximately equal 
in familiarity to both cultures. 

3* Culture-Equivalent Tests 

Dr. Schwartz also developed another variant of this concept 
called "cultural- e quivalent" tests, denoting that two tests which are not 

14 



- 11 - 

identical may, in fact;, be equivalent. In this case investigations 
were iindertalcen to discover cultural counterparts of tools and machines, 
cultural" manifestations of mechanical principles, and cultural 
opportunities to acquire information of potential relevance to mechan- 
ical training. The argument of ciiltural equivalen. 'ests on the 
demonstration that tests constructed in this way have been valid predictors 
of performance in Westernized training programs 'r: si.op mechanics, 
electrical repair, and the like. Developement ^1 similar tests in this 
country is impeded by lack of knowledge concerning the culture of southern 
Negroes, northern slum-dwellers of all races, or any other identifiable 
sub-groups. Dr. Ash asserts that so-called culture-fair tests do not 
measure aptitudes or characteristics significantly related to most 

ordinary measxires of job success such as turnover, production or foreman 
12/ 

ratings . 

B. Creativity Tests 

Another approach, adopted by Dr. Newton Metfessel and 

Professor J. J. Risser, of the U. Iversity of Southern California involves 

the use of tests to measure creativity rather than traditional intelligence 

tests. The latter sample only a relatively small portion of the factors 

which are involved in intellectual potential and have placad a premium on 

Verbal comprehension and speed of response and emphasize convergent 

13/ 

thinking, or the ability to select the one correct answer. 
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Creativity tests, on the other hand, stress divergent thinking 
or the ability to create new or original answers. They are, according 
to Metfessel, more suitable for the testing of the ciolturally disadvan- 
taged and certain ethnic groups whose command of language is not 
highly developed. 

These tests utilize the most common and familiar of objects in 

order to sample the testee^s ability to recognize problems, and his 

originality, flexibility, and fluency of thinking. Tasks include 

suggesting improvements in a familiar device such as a telephone, or 

thinking of problems that might occur in the use of an object such as a 

-oandle. One test requires the subject to list as many uses as he can 

14/ 

for a broom handle. 

The tests are scored simply on the nxmber of acceptable answers 

given by the subject. They seem to be as effective in predicting 

academic success as traditional intelligence tests and, probably, would 

•be as effective as the latter in predicting job performance. 

C. Differential Selection Among Applicants From Different 
Socio-Economic Ethnic Backgrounds 

It has been proposed that, since prediction equations for job 
performance for most tests .currently in use have been based on the 
performance of whites, different standards (separate test norms^ 
conversion tables, prediction weights, etc.) be employed' for Negores 
and other culturally disadvantaged groups. This approach involves a 
technique known as the moderator variable . Applicants for a given job 
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for different ethnic groups do not mean a lowering of standards because 

the standards "which count are standards of performance ori'the job, not 

the selection standards. Equally qualified persons may be selected 

from various ethnic groups by applying the standards -which are appro- 

15/ 

priate to each group.— ^ 

Lockwood has proposed the use of "cultural exposure" 
as a moderator variable. Examinees should be grouped homogeneously 
as to cultural exposure and these groups treated separately in validity 
studies. Cultural exposure is defined as the material things to which 
a person has been exposed and the attitudes to -which he has been exposed 
and "which he has acquired. Research would lead to a better identifi- 
cation of the culturally disadvantaged and to the utilization of their 

abilities through a refinement in .-prediction of training and occupational 
16/ 

success. — ^ 

A major investigation is under way by Dr. Richard Barrett to 
determine if the division of applicants into sub-groups improves the 
accuracy of prediction for members of both groups. If selection is 
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L3, Dreger, Ralph M. , and Miller, Kent S. , "Recent Research in 

Psychological Comparisons of Negroes and Whites in the United States," 
(Presented at Southeastern P svcholoSical Association, Atlanta, Ga., 
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improved by applying different procedures to the high and low socio- 
economic groups, then, the more, talented would benefit, .regardless, of race. 
"It may also happen that dividing the group of applicants on the basis 
of race may lead to improved accuracy of predictions for members of 
both races. Such a result has far reaching implications for fair 
employment practices because failure to treat the tm) races separately 

• rfould, if current policing jwere followed, lead to discrimination against 

17/ 

the more talented Negroes." — ^ 

The overwhelming evidence is that the cultural background of 
the Negro in America today is so different from that of the white that 
his performance during the selection process can reasonably be expected 
to be different. It may be difficult to find an adequate sample of 
Negroes in most occupations in order to develop separate and suitable 
prediction equations for them. Lockwood also cautions against the use of 
a lower minimum score or separate standards of test performance for 
Negroes since it might tend to perpetuate the idea of race differences 
or race inferiority. 

D. Dual Test Standards and Compensatory Training 

The concept of a "dual standard" has some support among 
psychologic uS, Ash cites the* work of Dr. Kenneth B. Clark of the 
City University of New York. Clark's work suggests that culturally 
deprived people who score low on trists may tend to overachieve on the 
job. In studying the college performance of students who scored low 
on college entrance tests ^ Clark found that for students from non- 
deprived omironments, the tests were good predictors^ and low college 
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entrance test scores were accurate indicators of poor grades. On 
the other hand, students coining from deprived environmencs did signifi- 
cantly better in college than would have been predicted from the tests.' 



Stores in Detroit, Michigan, indicates that a lowering of required 
test scores i-dll not necessarily result in a lower quality of job 
performance. The Federal Department Stores took l6 young people from 
ciilturally and economically deprived ii-reas, all of whom had failed 
standard employment tests and were classified as "unemployable", and 
put them through a 10-^eek special training program. All l6 subsequently 
were employed, 14 at. Federal and two elsewhere. The record of per- 
formance of all 14 employees at Federal exceeded what was predicted by 
standard sales aptitude tests. Some exceeded the company's rainimiam 

12/ 



performance standards ^ov new employees by "unbelievable margins." 



one of the first of its kind in offering compensatory training for 
individuals with low test scores, the concept cf "double-standard" has 
had wide acceptance for years in the fairly common practice of maintaining 
different norms for the sexes. Several popular tests which offer different 
sex norms are the Bennet, The Wonderlic, the Minnesota Paper Form Board, 
and the Thur stone Temperament Schedule. 



tests are undoubtedly of environmental origin. Girls, are expected to 

score lower than boys on tests of mechanical information. It is also 

*Re-test results one year later for the ten trainees still employed by 
Federal showed no significant changes in the scores as a group. 



An experijaental training program nan by the Federal Department 




Although the Federal Department Stores experiment is considered 



It is generally agreed that some of these sex differences on 
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expected that girls will perform less effectively on tasks for which 

the Mechanical Information test is a predictor. This, however, does 

not prevent many companies from employing women in manufacturing tasks 

20/ 

which require mechanical ability where they perform satisfactorily. — 

On the basis of these examples, it appears that a ^'double- 

standard'' can be justified in some circumstances , though a double standard 

in job performance and hiring of less qualified applicants is usually 

rejected as not being effective. If it can be demonstrated that score X 

for Group A and Score K-k for Group B are associated with identical levels 

of performance on the job, then an employer might reasonably consider 

21/ 

adopting a more flexible attitude toward test scores. — 

E. Intensification of Recruitment - While there are significant 
differences in average performance, there is a considerable overlap in 
the distribution of test scores of whites anu Negroes. It has been 
proposed, on the basis of this observation, that employers who wish to 
maintain their present standard of performance on their pre-employment 
tests, can increase their number of Negro employees by intensifying 
recruitment among Negroes in order to identify those whose test perfor- 
mance is equal to that of acceptable white applicants. Although this 
approach has merit in that it could provide employment for Negroes who 
are qualified but who do not apply for jobs in companies where they 
assume discrimination is practiced, it is not a solution to the testing problem 
It ducks the question of the fairness of tests to those who fail because of 



20 



- 17 - 

cultural disadvantage, and it will not provide enough additional workers 

22/ 

to satisfy present and fufciire labor needs. — ^ 

F. Use of Test Scoras as Only One Indicator - One other practical 
solution similar in many respects to the "double-standard" is to use 
test scores as only one indicator among others in the hiring decision, 
•with a clear awareness that, where the applj,cant^ has not 

shared in the predominant middle-class verbal culture, the test score 
significantly underestimates his potential. A difference of one point 
more or less cannot be expected to determine if an applicant -will fail 
or succeed on the job. Other personal characteristics such as achieve- 
ment, motivation, and dependability may be just as significant indicators 
of successful job performance, and they usually can be identified in each 
cultural group. 

G. Proper Testing Practices - Along vath adoption of a more flexible 
attitude toward test scores, the most immediate improvement can be 
accomplished by an emphasis by the employer on proper testing practices. 

(l) The employer could reconsider the relevance of the 
qualifications for employment to the specific job tasks required by his 
company. Many of these requirements are stated in terms of some general- 
ized stereotypes, such as high school graduat.:, high IQ, or potential to 
advance to higher level jobs, and are quite extraheous to the requirements 
of that job. Tests sliould be professionally chosen to fit the distinctive 
features of both the industry and the background, education and other 
characteristics of the successful work force. It is unreasonable to insist 
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that all iDwei:. level workers, have.. potential . for > supervisory positions. An 
eiuployer may eventually find that by adopting a more reasonable set 
of qualifications for each job^, he \dll have access to a considerably 
larger source of workers who can perform capably and who will present 
him with fewer problems of employee frustration or labor turnover. 

(2) Selection tests should be developed by reputable profes- 
sional psychologists who are competent in' conducting testing programs 
in an industrial setting. 

(3) Pre-employment tests should be administered by personnel 
who are properly trained not only in the technical details of giving 
tests, but also in the orientation and handling of people in the testing 
situation. Members of disadvantaged groups tend to be particularly 
sensitive to any mannerisms that might be considered antagonistic, 
sarcastic, or condescending, and test administrators should be aware 

of this and be able by their behavior to alleviate a certain 
amount of test anxiety. A personnel manager at a recent testing 
conference complained that the number of Negro applicants for Jobs 
in his company had fallen off by 80 percent after the company recently 
instituted a pre-employment testing program. 

(4) A policy of re-testing "failure" candidates may gain for an 
employer many good employees who otherwise would have been eliminated 
by the first test. Many candidates, particularly members of minority 
groups, regard testing as a threatening situation and do not perform 

as well as they could. A second test would provide a more accurate 

^University of Michigan Testing of Minority Group Applicants, .January 26, 
1966. 
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indication of the true capability of a person who is less experienced 
with testing situations and who may have been intimidated by his first 
experience. 

(5) Finally, the most important principle is validation 
of tests in order to confirm the relationship between test scores and 
on-the-job performance. There is general agreement that tests should 
not be used for a group which differs from the validation group. 
Validity is relative both to the criterion to be predicted and to the 
group for which the prediction is to be made. Very few employers have 
validated their testing instruments. In a recent survey by the 
University of Wisconsin Industrial Relations Research Center, 152 companies 
which apply testing techniques were canvassed and only 7 percent reported 
that all their tests had been validated locally against on-the-job perfor- 
mance measures. Nearly 60 percent had validated none of their tests. The 

23/ 

remainder reported that some but not all of their tests were validated. 

Dr. Warren Ketcham, University of Michigan psychologist and 
Vice President of Psychodynamics Research and Associates, has suggested that 
within company norms should be used exclusively. This only requires that 
an applicant perform on tests as well as or better than persons who have 
done or are presently doing the job satisfactorily. The norm tables should 
then be used to rank applicants as sub-standard, low-average, average, 
high-average, or superior. — 

From recent discussions with research psychologists attached to large 
industrial concerns, it appears that many companies are developing ability 



23 



- 20 - 

tests which will measure the essentials required for training or 
employment, while keeping at a minimum the relevant aspects of culture. 
For a number of reasons, these findings may never be released for general 
consumption. One of the responsibilities of the Commission will be to 
encourage this type of research by the psychological profession. If the 
purpose of tests is to uncover talent and potential, irrespective of 
label, surely the Commission could not advocate a more commendable policy. 
V. United States As A Model Employer 

If the Equal Employment Opportunity Commission establishes basic 
guidelines on testing of minority group applicants, including a provision 
on validation of tests, it will require private employers to satisfy 
certain standards which the United States government, as a civilian 
employer, for the most part does not meet. 

The U. S. government has set a fine example in its standardized test- 
ing program for the military where these tests have been completely 
validated. Testing in the Armed Forces serves a number of major programs, 
two of which are (1) to identify the number of personnel required in each 
skill and professional category, and (2) to identify each individual for 
training, upgrading, and utilization to his highest potential. 

In order to maintain validity, test development activities are mainly 

serviced by professional job analysts, subject matter specialists, and test 

psychologists and validated in the working area. This systematic approach 

is essential to assure that the tests sample specific job functions in 

*0f some interest is the fact that the United States Employment Service 
has recently undertaken a program to develop aptitude measures that can be 
used to evaluate potential for literacy training, vocational training and 
occupational potential of the educationally deficient. Much' of the research 
is designed to improve the General Aptitude Test Battery (GATB) . 
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direct proportion to the importance of those functions to the job. As 
a result, job analysis provides not only a basis for test construction, 
selection and training, but also a means for increasing productivity 
and facilitating work. 

VI. Recommendations for Testing Guidelines 

The following recommendations are designed as a guide to help 
employers establish objective standards for selection, screening, and 
promotion of workers. These procedures should ensure that all qualified 
applicants are given equal opportunity for employment. 

1. Job descriptions should be examined and their critical 
requirements established before tests are selected for screening 
applicants. 

2. Tests used should be those developed by reputable psychologists. 
Such tests should be administered by professionally qualified personnel 
who have had training in occupational testing in an industrial setting. 

3. Rigidly inflexible minimum scores should be re-examined in light 
of the considerable research under way on differential selection. 

4. Test scores must be considered as only one source of information, 
and must be combined with other available data on performance such as 
motivation, leadership and organizational experience, self-sufficiency, 
and dependability. 

5. Tests should be validated within the setting where they will 
be used. Validation should be for as many separate groups as possible 
in preference to one large heterogeneous group. 
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6. It may be advisable for employers who deal with applicants 
from culturally disadvantaged backgrounds to offer re-tests to candidate 
who are unsuccessful on their first try, since these people are less 
familiar with testing situations and may not perform as well as they are 
able. 
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APPENDIX A 
CHRONOLOGY OF TRE MOTOROLA CASE 

I. July 15, 1963 - Leon Myart, a Negro, applied for a job as a 
television phaser and analyzer at the Franklin Park plant of 
Motorola, Inc. Myart took a five minute intelligence test (General 
Ability Test No. 10), was interviewed, and was sent home without 
being told whether he qualified for employment. 

II. July 29, 1963 - Failing to receive a job offer, Myart filed a 
complaint with the Illinois Fair Employment Practices Commission and 
the President's Committee on Equal Employment Opportunity alleging 
that his not being hired was due to racial discrimination. 

III. January 27-28, 1964 - Hearing of the Motorola case before hearing 
examiner Robert Bryant of the Illinois Fair Employment Practices 
Commission. 

IV. February 26, 19 64 - The hearing examiner directed that Myart be 
offered a job, that test No. 10 should no longer be used, and that 
any new test developed in its place should "reflect and equate 
inequalities and environmental factors among the disadvantaged and 
culturally deprived groups." He argued that the test had been normed 
on "advantaged groups" and did not "lend itself to equal employment 
opportunity to qualify for the hitherto culturally deprived and 
disadvantaged groups." 
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V. April 18, May 25, July Ih-lS. 1964 - Review of the Motorola 
case before the full Commission, 

VI. November 18, 1964 - The Commission issued its xmanimous decision, 
finding that Miyart had been denied employment because of his race 
and while not supporting the order to hire J^jrart directed that 

he be compensated one thousand dollars. 

VII. April 27, 1965 - Illinois Circuit Court decision on appeal of 
Motorola. The ruling requiring Motorola to pay I^jrart one 
thousand dollars was reversed, but the Commission's findings 
on discrimination were upheld. 

VIII > November 11. 1965 - Case argued before the Illinois Supreme 
Court. 

IX. Mkrch 24, 1966 - Illinois Supreme Court reversed ^ he judgment of 
the. circuit court on grounds that the alleged unfair employment 
practice was not established by a preponderance of the evidence. 
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APPENDIX B 

GLOSSARY OF SPECIAL TERMS 



Criterion - A standard that provides a basis for evaluating the 
validity of a test. 

Cultural bias - Propensity of a test to reflect favorable or un- 
favorable effects of certain t3^es of cultural backgrounds 

Culture-fair test - A test yielding results that are not culturally 
bv.jased. 

Culture-free test - A test yielding results ^.hat are not influenced 
in any way by cultural background factors. 

Norms - Statistics that depict the test performance of specific groups 
Grade, age, and percentile are the most common types of 
norms . 

Reliability - The degree of consistency, stability, or dependability 

of measurement afforded by a test. 
Validity - The ex±ent to which a test measures the trait for which it 

is designed, or for which it is being used, rather 

than some other trait. 
Psychological test - An observation of a sample of human behavior made 

under standard, controlled conditions which results in a 

linear evaluation called a score. 
Culturally disadvantaged - Groups which do not have full participation 

in American society because of low incomes, substandard 

housing, poor education, and other "atypical" environmental 

experiences. 
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